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Abstract 

We extend a recently developed exact schema 
based, or coarse grained, formulation of ge- 
netic dynamics jl2|, |l3|, Q and its associated 
exact Schema theorem to an arbitrary selec- 
tion scheme and a general crossover operator. 
We show that the intuitive "building block" 
interpretation of the former is preserved lead- 
ing to hierarchical formal solutions of the 
equations that upon iteration lead to new re- 
sults for the limiting distribution of a pop- 
ulation in the case of 1-point crossover and 
"weak" selection, where we define quantita- 
tively "weak". We also derive an exact, ana- 
lytic form for the population distribution as 
a function of time for a flat landscape and 
1-point crossover. 



1 Introduction 

Developing a better qualitative and quantitative un- 
derstanding of the theory of genetic dynamics, and 
thereby of Genetic Algorithms (GAs), remains a chal- 
lenging problem by any measure. The benefits of hav- 
ing a better understanding are almost too obvious to 
mention. However, it is worth emphasizing the scale of 
the task. Almost all known theoretical results are for 
systems with a small number of loci and even in the 
case of very simple landscapes very little is known as 
far as explicit rather than formal, or numerical results 
is concerned. In particular we are unaware of any sys- 
tematic schemes for developing approximate solutions 
such as perturbative methods. 

Passing beyond canonical results, such as Holland's 
Schema theorem ]lj], which is associated with an in- 
equality for the dynamics rather than an equality, var- 
ious exact evolution equations have been derived pre- 



viously: Goldberg and Bridges [g] wrote down exact 
equations for two-bit problems. Later these equa- 
tions were extended to three and four-bit problems 
S. These equations allowed for an explicit analysis 
of string gains and losses. Whitley and Crabbe Q 
also presented an algorithm for generating evolution 
equations for larger problems that was equivalent to 
the earlier equation of [0 . Although exact these equa- 
tions are extremely unwieldy and it is difficult to infer 
general conclusions from their analysis. 

Another related approach is that of Vose and collabo- 
rators H H, Q that treats GA evolution as a Markov 
chain. Noteworthy, and less familiar in the GA com- 
munity, is analogous work in mathematical evolution 
theory | p[ p| (see also the significant article of Al- 



tenberg |10| that has a foot in both camps). This 
formulation of GA dynamics appears to be so removed 
from elements such as the Schema theorem and Build- 
ing Block hypothesis that the latter have very much 
fell out of favour with aficionados of this dynamical for- 
mulation. Also, being fundamental, microscopic equa- 
tions they do not lend themselves easily to a treatment 
of schemata. It is important to note that this is not 
just an academic point: genetic dynamics with a large 
number of degrees of freedom is so complex that a for- 
malism that treats the effective degrees of freedom at 
a more macroscopic level is certainly required. Such 
a formalism could be postulated directly, such as in 
the effective theory of Shapiro and collaborators pj| , 
or better, derived from the underlying microscopic dy- 
namics. 

Evolution equations that offer the benefit of a very in- 
tuitive interpretation, that illuminate the content of 
the Schema theorem and the Building Block hypoth- 
esis, that naturally coarse grain from string equations 
to schema equations, that yield an interpolation be- 
tween the microscopic and the macroscopic and that 
offer new exact results or simpler proofs of known re- 
sults have been recently derived 12 [u| [uj. These 



equations lead to many insights into the dynamics of 
GAs offering an exact Schema theorem that naturally 
incorporates a form of the Building Block hypothesis. 
Originally developed for a canonical GA (proportional 
selection, 1-point crossover and mutation) the basic 
elements have also recently been extended to Genetic 
Programming (GP) by Poli and coworkers [l6| who 
have showed the utility of the formalism by deriving 
several interesting new results. 

In this paper we generalize the formalism of [fl2l , 
[b| [l4|, to arbitrary selection schemes and a general 
crossover operator showing that the basic advantages 
of the former are preserved. We see that evolution 
can be fruitfully viewed as the hierarchical construc- 
tion of more and more complex building blocks. The 
usefulness of crossover in aiding search is governed by 
linkage disequilibrium coefficients associated with se- 
lection probabilities rather than gene frequencies. We 
consider the limit distributions of a population show- 
ing that in the long time limit an arbitrary schema 
reaches Robbins proportions but with a coefficient dif- 
ferent than one. We also derive an exact, analytic form 
for the dynamics of a GA for a flat landscape and 1- 
point crossover. 

2 Microscopic String Evolution 
Equations 

In this section we recapitulate the results of JlJ, |l3|, 
, taking the opportunity to show their explicit gen- 
eralization to a general crossover operator and arbi- 
trary selection. We start with an exact evolution 
equation that relates string proportions, P(c,-,t) = 
n(Ci,t)/n, for a genotype Cj consisting of binary al- 
leles, at time t to those at t + 1, where n(Ci,t) is the 
number of strings Cj in the population at time t and n 
is the population size. In the limit n — > oo the P(Ci, t) 
give the probability distribution for the population dy- 
namics and satisfy 

P(c i ,t + l)=V(o i )P c (c i ,t) 

+ E V{ Cj ^)P c {c 3 ,t) (1) 

c^c, 

where, for an infinite population, P c (Ci,t) is the ex- 
pected proportion of strings of type Cj after selection 
and crossover. 

The effective mutation coefficients are: V{c.i) = 
rifc=i(l -Pm(fc))i which is the probability that string 
% remains unmutated, and 7 3 (c J -.c i ), the probability 
that string j is mutated into string i given by 

v(c^ Ci )= n Pm (k) n Q.-pm(k)) (2) 



where p m (k) is the mutation probability of bit k. For 
simplicity we assume it to be constant, though the 
equations are essentially unchanged if we also include 
a dependence on time. {Cj-d} is the set of bits that 
differ between Cj and Cj and {Cj-Cfic, the complement 
of this set, is the set of bits that are the same. In the 
limit where p m is uniform, V{o^j = (1 — Pm) an d 

V{c^ Ci ) = pC (i ' j) (l-Pm) N - d " {i ' j \ where d H (i,j) 
is the Hamming distance between the strings Cj and 
Cj . Note that for a finite population the left hand side 
of (P is the expected proportion of genotype c\ to be 
found at t + 1 while any P(d,t) on the right hand side 
are to be considered as the actual proportions found 
at t. 

Explicitly P c (Ci,t) is given by 

P c (c i ,t)=P'{c i ,t) 

2™ 

-5>(m) E c[f c »P'(c„i)P'( Cj ,i) 

m=l Cj^Ci 

2« 

+X>(m) E E C^ Cl {m)P'{c 3 ,t)P'{c h t) (3) 

m=l Cj^Cj C^Ci 

where YT =1 i s the sum over all possible crossover 
masks m E M, M being the space of masks, and 
Pc{m) is the probability to implement the mask m. 
P'(Ci, t) is the probability that genotype C{ is selected 
and the coefficients C^ Cj (m) and c ,(ro), represent 
the probabilities that, given that Ci was one of the 
parents, it is destroyed by the crossover process, and 
the probability that given that neither parent was Ci 
it is created by the crossover process. 

In order to write these probabilities more explicitly we 
denote the set of alleles inherited by an offspring from 
parent Cj as S and the alleles inherited from parent 
Cu i.e. the set Ci — S, by C. Naturally, S and C 
both depend on the particular crossover mask chosen. 
Then, 

C^ Cj {m) = e{d^{i,j))e{df{i,j)) (4) 

and 

C$7, (m) = lW%(i,j))6(d?(i,l)) 

+6(d?(t,3))5(df(tJ))} (5) 

where d^(i,j) is the Hamming distance between the 
strings Ci and Cj measured only over the set S, with 
the other arguments in (Q) and (||) being similarly 
defined. 8(x) = 1 for x > and is for x = 0, 
whilst 5{x) =0 Vx ^ and (5(0) = I. Note that 
Cc^Cj ( TO ) an d C<^ C; ( m ) are properties of the crossover 







process itself and therefore population independent. 
The equations (|j) and (||) yield an exact expression 
for the expectation values, n(Ci,t), and in the limit 
ri — > oo yield the correct probability distribution gov- 
erning the GA evolution for arbitrary selection, muta- 
tion and crossover. It takes into account exactly the 
effects of destruction and construction of strings and, 
at least at the formal level, is either a generalization 
of or is equivalent to other exact formulations of GA 
dynamics |, | | | | 

As an explicit example, consider 1-point crossover. In 
this case there are only N— 1 pertinent crossover masks 
labelled by the crossover point k. Then, p c (m) = 
p c /N — 1 for m = A;, k G [1, N - 1] and p c (m) = oth- 
erwise. Also, S — L and C — R (or vice versa) where 
L and R refer to the parts of the string to the left and 
right of the crossover point respectively. For 2-point 
crossover there are N C 2 non-zero masks labelled 
by two crossover points, k\ and k%. Then p c {m) = 
p c / N - 1 C 2 for to e {ki,k 2 } with fci, k 2 e [1,N - 1] 
and ki > k 2 . In this case S represents the parts of 
the string outside k\ and k 2 and C the part between 
them (or vice versa). As a final example, for uniform 
crossover p c (m) = p c fi Ns (1 — /j,) N ~ Ns /2 N where /j is 
the probability that a given allele is inherited from 
parent Cj , S being the set of alleles inherited from Cj . 

Computationally, the above representation is very re- 

(2) 

dundant. The matrix C c c (m) has dimensionality 
(2 N - 1) x (2 N - 1) but only (2 Ns - 1) x (2^ - 1) 
matrix elements are non-zero where N s is the number 
of alleles of Cj inherited from one parent and N c the 
number from the other. As a consequence, and also 
for other reasons, iterating the dynamics is exceedingly 
difficult. The equations (HJ) and (||) offer little in terms 
of intuitive understanding of the evolutionary dynam- 
ics and, arguably, not a great deal in terms of formal 
results, though Vose and coworkers have made impor- 
tant contributions in this area. Intuitive elements of 
GA theory such as the Building block hypothesis seem 
to play no role here and are not apparent in any de- 
gree. Additionally, a more general formulation giving 
the dynamics of schemata is relatively unnatural in 
this formulation. 

3 Coarse Grained Evolution 
Equations 

Far greater progress can be made by changing basis 
from a microscopic string basis to a coarse-grained ba- 
sis associated with schemata. Such a basis emerges 
very naturally anyway from equation (^). 

To see this, consider first the destruction term. For a 



given crossover mask the matrix (Q) restricts the sum 
over the strings Cj to those that differ from c$ in at 
least one bit both in S and in C. One can convert the 
sum over Cj into an unrestricted sum by subtracting 
off those Cj that have df(i,j) = and/or df(i,j) = 0. 
We then use the fact that Ylc P'( c j^) = 1 to gain 
a tremendous simplification of the destruction term. 
Similarly one may write unrestricted sums for the con- 
struction term by subtracting off explicitly those terms 
that have been added. The unrestricted construction 
term then becomes 

J2 E p'{c 3 ,t)p'{ Cll t) (6) 

where cf is the part of Cj inherited from Cj, i.e. the al- 
leles S, while c\ is the part inherited from C/, i.e. the 
alleles C. Obviously, both cf and c\ are schemata. 
Noting that the extra terms originating from the con- 
version of the restricted sums to unrestricted sums in 
the destruction and construction terms exactly cancel 
one can finally rewrite P c (Ci,t) as 

P c {Ci,t)=P'(Ci,t){l-p c ) 

2 N 

+Y.Pc{™)P'{c s l (m),t)P'{c c l (m),t) (7) 

where p c = X)l=iPc(™) is the probability to imple- 
ment crossover (irrespective of the mask) and 

P'(cf(m),t)= £ P'{Cj,t) (8) 

and similarly for P'(a?(m),t). In the absence of mu- 
tation P c (Ci,t) = P(Ci,t + 1). It is important to 
note here that in this form the evolution equation 
shows that crossover explicitly introduces the idea of a 
schema and the consequent notion of a coarse graining 
relating a string Cj to its building blocks, cf and c\ , 
which are schemata of order A^ 5 and N c — N — N s 
respectively. Notice by going to this coarse-grained 
basis that the potentially (2 N — 1) x 2 N destruction 
terms, (2 N — 1) being the number of terms in the re- 
stricted string sum and 2 N being the number of pos- 
sible crossover masks, have been reduced to only one 
term and that for a given crossover mask the (2 N — l) 2 
construction terms have been converted into one sin- 
gle crossover term P'(cf(m),t)P'(c^(m),t). Actually, 
this somewhat overstates the case as the schema aver- 
ages contain (2^ — 1) and (2^ — 1) terms respectively. 
However, at the very least the change in basis has ex- 
plicitly eliminated the zero elements of C<? Cl (m) . 

So let's recapitulate how the three different operators 
- mutation, crossover and selection - enter in this ex- 
act dynamics. First of all, selection is embodied in 



the factor P' which is a "black box" for the pur- 
poses of the dynamics. That is to say, the highly 
simplified and elegant form of the dynamical equa- 
tion (0) is independent of the type of selection used 
and is a direct consequence of the form invariance of 
P' under a coarse graining. In other words the re- 
lationship between P'(d,t) at the string level and 
P'(cf,f) at the coarse grained level is strictly linear, 
i.e. P'(cf,t) = ^ CoC s P'(c„f). However, if one 
wishes to know how the functional dependence on P 
dynamically changes then one must implement a par- 
ticular selection scheme. For example, with propor- 
tional selection P'(d,t) = (/(Ci)//(f))P(c?j, f) where 
f(Ci) is the fitness of string d and /(f) is the av- 
erage population fitness. It is easy to demonstrate 
in this case that P'(cf,t) = (f(cf,t)/J(t))P(cf,t), 
where /(cf,f) is the fitness of the schema cf, which 
demonstrates the form invariance of proportional se- 
lection. Thus, the form invariance in this case also 
holds at the level of the Ps not just the P's. This 
will not generally be the case. For example with 
a "non- linear" selection scheme where P'(Cj,t) = 
a(Ci,t)P(d,t) + b(Ci,t)P 2 (Ci,t) there is no natural 
form invariance at the level of the Ps as P'(cf,t) ^ 
a(cf,t)P(cf,t) + b(cf,t)P 2 (c?,t). We also see that 
the elegance of the equation (Q) is independent of the 
type of crossover implemented. The details of the 
crossover mechanism enter in p c {m) and also deter- 
mine which alleles are included in S. 

To iterate the equation it is clear that we need to know 
the dynamics of the schemata cf and cf. Thus we 
need to obtain an equivalent equation for an arbitrary 
schema, £ of order 7V2 and defining length I, To do 
this we need to sum with Ec o{ on both sides of the 
equation ([I]). This can simply be done to obtain |l2|, 

EE 

p(e, f + 1) = v(e)p c {z, t)+J2 n^Pcdut) (9) 



where the sum is over all schemata, ^, that dif- 
fer by at least one bit from £ in one of the N2 
defining bits of £. In other words any schema 
competing with £ and belonging to the same par- 
tition. P c (£, f) = rifc=i(l — Pm{k)) is the prob- 
ability that £ remains unmutatcd and P c (£i,t) = 

Uk^-^jP^Uke^-i^M is the P^- 

ability that the schema ^ mutate to the schema £. 

-fc(£, t) = J2dD£ Pc( c ii *) 1S the probability of finding 
the schema £ after selection and crossover. Note the 
form invariance of the equation, i.e. (j^) has exactly 
the same form as (|l|) . To complete the transformation 
to schema dynamics we need the schema analog of (H) . 
This also can be obtained by acting with Eco£ on 



both sides of the equation. One obtains 

N Mr (l,N 2 " 



Pc(£,t) = (1-Pc 



+ Pc(m)P'(Z s (m),t)P'(e(m),t) (10) 

mEM r (l,N 2 ) 

where £ s represents the part of the schema £ inher- 
ited from the first parent and £ c that part inher- 
ited from the second. N Mr (l, N2) is the number of 
crossover masks that affect £, M. r being the set of 
such masks. N M is the total number of masks with 
Pc{m) 7^ 0. Obviously these quantities depend on the 
type of crossover implemented and on properties of the 
schema such as defining length. 

Once again for the purposes of illustration we may con- 
sider some specific examples. For 1-point crossover: 
N M = N - 1, N Mr = I - 1, E meMc fc(m) - 

p c /N — 1 y\1.\i where k is the crossover point, and 
£ s = £ L (fc) the part of £ to the left of the crossover 
point and £ c = £ fl (fc) is the part to the right. Substi- 
tuting into @ one finds the results of ||, [if, [lj] . For- 
m-point crossover: N M = N ~ 1 C m , N M _ r — ( N ~ 1 C m ~ 
N -'C m ) and E m£Mr .Pc( m ) -> PcJ2 kl>k2> ... >km , where 
fei, k m are the m crossover points. 

4 Schema Theorem 

With the exact evolution equations in hand we can 
extend the exact Schema theorem of [|l2| [l3|, |lj] to 
a general crossover operator and arbitrary selection 
scheme. Once again we state it through the concept 
of effective fitness ||, £i| 

Exact "coarse grained" Schema theorem 



P(U + i) = ^f^P(U) 



(11) 



where 



E 



= (v(e)P c (t,t) 
/(*) 



<)Pedi,t) 



(12) 



and P c (£,f) is given by equation (|i0|). The interpre- 
tation of this equation is as for its analog for 1-point 
crossover and proportional selection jl2| |l3|, 0: that 
schemata of higher than average effective fitness are al- 
located an "exponentially" increasing number of trials 
over time. An intuitive version of the Building Block 
hypothesis is part and parcel of this Schema theorem: 



(10) explicitly shows the competition between schema 
destruction and construction and shows how a schema 



is constructed from its building blocks £ s and £ c which 
in their turn are composed of building blocks £ s - s and 
£ sc and £ c ' s and £ cc respectively, which in their turn... 
The particular building blocks depend of course on the 
particular mask. Note that the crossover dependent 
part of (|l^) for a given crossover mask can be written 
as 

A(£, m) = P'(£, t) - P'{i s {m), f)P'(f (m), t) (13) 

which is a selection weighted linkage disequilibrium 
coefficient and, other than in a flat landscape, is a 
more relevant quantity than the standard linkage dis- 
equilibrium coefficient which is associated with the 
Ps rather than the P's. If A > then crossover 
has a negative effect, i.e. schema destruction dom- 
inates while if A < crossover is positive. Thus, 
whether the selection probabilities for the building 
blocks of a given schema are correlated or anticor- 
related determines the nature of crossover. It is im- 
portant to note that the hierarchical structure inher- 
ent in the iterative solution of (0) relating a string or 
schema to its more coarse grained antecedents termi- 
nates after N% steps when one arrives at the maxi- 
mally coarse grained effective degrees of freedom - 1- 
schemata. These play a priviliged role as they cannot 
be destroyed by crossover and in the continuous time 
limit satisfy P(£«,t) = exp /„*((/(£, *')//(*)) - l)dt' . 

One of the strengths of the present coarse grained for- 
mulation is that, as we shall see, much can be deduced 
simply by inspection of the basic formulas. We shall 
first of all put the basic equation into a yet more ele- 
gant form. We introduce a 2 Ar -dimensional population 
vector, P(t), whose elements are P(Ci, t), i = 1, 2 . 
Equation (|IJ) can then be written in the form 

P(f + l) = WP c (i) (14) 

where the mutation matrix W is real, symmetric and 
time independent and has elements W,j = pti (1 — 
pjN—d ^ wnere f or simplicity we restrict now to 
the case of uniform mutation (the generalization to 
non-uniform mutation is straightforward). Restrict- 
ing attention to the case of selection schemes linear in 
P(Ci,t), P G (t) can be written as 

2 N 

P c (t) = F(f)P(t) + J2 Pc( m )i(™, t) (15) 

TCI— 1 

where the selection - crossover destruction matrix, 
F(t), is diagonal, and takes into account selection and 
the destructive component of crossover. Explicitly, for 
proportional selection Fu(t) = (f(c l )/f(t))(l - p c ). 
Finally, the "source" matrix is given by j(m, t) = 



P'(cf(m),t)P'(Ci(m),t). Defining the selection- 
crossover destruction- mutation matrix W s (t) = 
WF(t) we have 

2 N 

P(t+l)=W.(f)P(t) + X>c(m)Wj(m,t) (16) 

m— 1 

The interpretation of this equation is that j(m,f) is a 
source which creates strings (or schemata) by bring- 
ing building blocks together while the first term on 
the right hand side tells us how the strings themselves 
are propagated into the next generation, the destruc- 
tive effect of crossover renormalizing the fitness of the 
strings. 

5 Formal Solutions of the Basic 
Equations 

Needless to say solutions of these dynamical equations 
are hard to come by. They represent, for binary al- 
leles, 2 N coupled non-linear difference equations, or 
in the continuous time limit - differential equations. 
As shown in |l2|, |l3| , compared to a representa- 
tion based on (|3|), even a formal solution of (0) in 
the absence of mutation and for 1-point crossover and 
proportional selection yields much valuable qualitative 
information, such as a simple proof of Geiringer's the- 
orem |l7| and an extension of it to the weak selection 
regime. Here we extend this formal solution to the 
case of general crossover and mutation and for any se- 
lection scheme linear in P(Ci, t). The iterated solution 
of © is 

t-i 

pw= n w » p (°) 

71=0 

2 N t-1 t-1 

+ E^( m )E II W,(i)Wj(m,n) (17) 

m—l n—Oi—n^rl 

This solution actually holds true for arbitrary 
schemata. The only changes are that the vectors are 
of dimension 2 2 , the matrices of dimension 2 N2 x 2 2 , 
the sum over masks for the construction terms is only 
over the set A4 r and that the building blocks in j(m, t) 
are those of the schema rather than the entire string. 

The interpretation of ( |l7|) follows naturally from that 
of jl^). Considering first the case without mutation, 
the first term on the right hand side gives us the prob- 
ability for propagating a string or schema from t = 
to t without being destroyed by crossover. In other 
words I7*o ^ s ( n ) 1S ^ ne Greens function or propa- 
gator for P. In the second term, j(m, n), each element 
is associated with the creation of a string or schema 



at time n via the juxtaposition of two building blocks 
associated with a mask m. The factor IIi=nWs(*) 
is the probability to propagate the resultant string or 
schema without crossover destruction from its creation 
at time n to t. The sum over masks and n is simply 
the sum over all possible creation events in the dy- 
namics. This formulation lends itself very naturally to 
a diagrammatic representation that will be discussed 
elsewhere. The role of mutation is to mix the strings 
or schemata created in the aforementioned process. 

So, what can be deduced from (|l7|)? First of all let's 
consider the case of a flat fitness landscape with no 
mutation; then P(i) = P(i) and W is the unit matrix, 
1, and W s = (1 —p c )l. Then, for an arbitrary schema 



crossover, where m > m' 



P(t) = (l-p, 
N 



N Mr (l,N 2Jv 
N M 



P(0) 



+ (1-Pc 



N, 



^(l-p c -^^)- {n+1) i(rn,n) 
n=0 ^ M 



(18) 



Pc N MAi,y y = o hence 



Now, obviously limt_ >0 o(l 

P(t) — * as t — > oo unless the summation over time 
leads to a cancellation of this damping factor. Given 
that the building block constituents of j(m,ro) are as- 
sociated with damping factors (1 — p c — ^j^f- — )* and 



(1- 



-Pc NMr ^ 1 N2 ^ Y this can only occur if there is 

no damping of the consituent building blocks and this 
only happens if they are 1-schemata as N Mr (l, 1) = 0, 
i.e. you can't cut a 1-schema. Given that it's a 
flat landscape P^ {z) ,t) = P(£ w ,0) where P(£ w ,i) 
is the probability of finding the 1-schema £M corre- 
sponding to the bit i, hence the only term of t) 
which gives a non-zero contribution when integrated 
is JT^ 2 i P(£ {i \ 0). Thus the limit distribution is 



No 



P*(0= lim P(€,t) = TT P ^ W ' ) 



(19) 



which is Geiringer's theorem for a general crossover 
operator. 

The only role the type of crossover is playing here is 
how fast the transient corrections to the limit distribu- 
tion die out. The damping is controlled by N Mr (I, N 2 ) , 
hence the bigger it is the faster the corresponding tran- 
sient dies out. For example, for entire strings (or 
schemata of defining length N) for m and m'-point 



Nj™)(l,N 2 ) 
N ( ™P(l,N 2 ) 



1 - 



r(N-l+l)r(N-m) 
r{N-l-m+l)r{N) 



1 



r(N- 

T{N~ 



l+l)r(N-m') 
l-m' + l)r{N) 



> 1 (20) 



where T(N) = (N — 1)!. Hence, we can quantify ex- 
actly how quickly m-point crossover transients die off 
relative to m'-point crossover transients. 

We can take this further and generalize Geiringer's 
theorem to non-flat fitness landscapes for selection 
schemes linear in P(d,t) such as proportional se- 
lection and for 1-point crossover and without muta- 
tion. Without loss of generality we write P'(£, £) = 
(1 + eJ(£, t)) where e will serve as a control parame- 
ter to determine how "weak" selection is, e<5(£, t) being 
the selection pressure for the schema £. Weak selection 
will imply e|5(£, t)| < 1 V£, t. A 1-schema evolves as 



f\ = p eA(£W ,t,t') prg{i) ^\ 



P(^\t)=C< 



(21) 



where A(£«,i,i') = J*S(^\t")dt" is the integrated 
selection pressure for the 1-schema. 

Theorem 

The limit distribution, P(£)*, o/P(£,i) as t — > oo 
for "weak" selection and no mutation and 1-point 
crossover in the continuous time limit is 



No 



P(0*=A(^N,l, Pc )l[P(^,0) 



(22) 



where the selection dependent amplitude A(£, N,l,p c ) 
is given by 



lim 

t — >oc 



Pc(i-l)t 
AT— 1 



N 2 



A(£,N,l,p c ): 
e -r e (l-^#^)A( ? ,M-^) 

N2 AI*M * 



Y[(l + S(^\t- _i^ T ))eE i = 1 Att w .*-3&.°)d T (23) 
i=i 

Note that for a flat landscape A — ► 1. Due to space 
limitations here a proof of this theorem will be pre- 
sented elsewhere, though we can outline the logic. The 
key to the theorem is deciding just how weak selec- 
tion must be in order that all higher order schema 
transients die out. It is for this reason that a tuning 
parameter was introduced. For a given weak selec- 
tion landscape and values of I, N and p c there ex- 
ists a critical value of e, e cr (£, N, l,p c ), below which 
all non- 1-schema transients die out as t — > oo. One 
examines the growth or decay rate of a schema and 
all its constituent building blocks. Growth is caused 



by selection and decay by crossover. By examining 
the dominant growth mode, other than that associ- 
ated with pure 1-schemata, one can tune e such that 
it becomes a decay mode. All other growing modes 
are subdominant and therefore also decay. Hence, the 
only non-zero mode in the limit t — > oo is the 1-schema 
mode of (p2|). The difference with the result for a flat 
landscape is that in this case the effective fitness land- 
scape for 1-schemata is not flat. As an example, if one 
considers a weak counting ones landscape, i.e. where 
f(Ci) = 1 + eni(Ci), n\(Ci) being the number of ones 
on the string, then e cr ~ p c /N 3 for strings. 

6 Explicit Solutions 

To show the further utility of the present formalism we 
will find the general explicit solution to (fjj) for a flat 
fitness landscape, without mutation and for 1-point 
crossover in the continuous time limit. Lest this be 
thought a trivial problem it's wise to remember that 
it still involves the solution of 2^ coupled non-linear 
differential equations. To illustrate the general princi- 
ples we'll consider first a three bit problem. The gen- 
eral structure of the equations (Q) is that one builds 
up a solution via intermediate building blocks. The 
most fundamental blocks are 1-schemata as these can- 
not be cut and hence transform trivially under re- 
combination (save in the case of uniform crossover). 
P"< N2 )(t) in this case is an A^-dimensional vector and 
p(W)(£) = p(i.JV2)( ), with / = landiVa = 1. Explic- 
itly, P(i **,*) = P(i * *,0), P(*j*,t) = P(*j*,0) and 
P(* * k,t) = P(* * k,0). There are four 2-schemata 
per crossover point corresponding to P(ij*,t) and 
P(*jk,t), i, j — 0, 1. P(ij*,t) satisfies 

P{ij*,t) = e-*¥ P(ij*,0) 
ft i 
+ Pc e -e±L / p^fc^p^^e 2 ^-^' (24) 
2 Jo 

with an analogous equation for P(*jk,t). The simple 
solution of (|JJ) is 

P(ij*,t) = e _£ ^P(r/*,0) 
+ (l-c- £ § i )P(i**,0)P(*j*,0) (25) 
The 3-schema solution is found using ( p5| ) 
P(ijk,t) = e- pat P(ijk,0) 

2 Jo 

+P(i**,t')P(*jk,t'))dt' (26) 

Substituting the solution (|25|) into (|2^) one finds sim- 
ply 

P(ijk,t) = e" Pc *P(ijfc,0) 



+e _£ ^(l - e -2 ^) (P(ij*,0)P(* * fc,0) 
+ P(i * *, 0)P(*jk, 0)) 
+(1 - e -2 ^ fP(i * *, 0)P(*j*, 0)P(* * k, 0) (27) 

In the limit t — > oo, P(ijk, t) — > P(i * 
*, 0)P(*j*, 0)P(* * k, 0). We see here the approach 
to Robbin's proportions is exponentially fast with the 
bigger schemata dying off quicker. 

The general solution for an iV-bit string is 

N-l 

P(d,t) = e"^(l-e-* i r) A '-™- 1 P(n + l) (28) 

n=0 

where V(n + 1) is an initial condition and represents 
a partition over the probabilities for finding N — n 
building blocks at t = 0. For a given n there are 
N ~' L C n such terms. Equation (^?]) offers a simple il- 
lustration where N — 3, hence rt = 0, 1, 2. n = 
corresponds to the N building block terms of which 
there are N - 1 C = 1 term, P(i**, 0)P(*j*, 0)P(**fc, 0). 
n = 1 corresponds to the iV — 1 building block terms of 
which there are N ~ 1 C\ = 2 terms, P(ij*,0)P(* * fc,0) 
and P(i * *,0)P(*jk, 0). Finally, n = 2 corresponds 
to the N — 2 building block terms of which there are 
*-i<7 2 = 1 term, P(ijk,0). 

Notice that ( p8| ) gives not only the asymptotic be- 
haviour but also the complete transients. We are un- 
aware of any similar result in the literature. It is not 
difficult to prove that (|2q) is the general solution by 
showing that it satisfies (|7|). This requires the solu- 
tions of (0) for c L (k) and C R {k) 7 the building blocks 
of Ci also. From the form invariance of the equations 
the solution of (^) is 

P(cf,t) = 

l L (k)-l 

J2 e- S &£(l-e--&) tr ^ nL - 1 V(n L + l) (29) 

n L =0 

where N — til is the number of building blocks of cf(k) 
and lh{k) is the defining length of cf(k). Using the 
analogous equation for cf , (k) and the fact that n = 
n L + n R one sees that both sides of (Q) have the same 
time dependence hence it suffices to prove the equiv- 
alence at t = 0. The equality at t — hinges on the 
identity that T(n+l) = £™~i V{n L + l)V(n R + 1). 

In the case of zero mutation the other exact limit of 
the evolution equations is that of pure selection. In- 
terestingly, the solution is only exact in the case of dis- 
crete time. With these two exact limits in hand there 
should be scope for obtaining perturbative solutions to 
the evolution equations either in powers of e for weak 
selection or in powers of p c for strong selection. 



7 Conclusions 

In this article we have generalized the formalism of 
|l2] , [Tj], [l4| to cover arbitrary selection schemes and 
a general crossover operator. We have tried to em- 
phasize the advantages of this formulation and in par- 
ticular show how these advantages have concrete pay- 
offs. We believe that our coarse grained formulation 
is more intuitive in its content than others, giving an 
exact schema theorem that contains in a very obvi- 
ous manner a Building Block hypothesis. Its coarse 
grained hierarchical structure, both in terms of time 
and building block complexity, leads to a formulation 
wherein results such as Geiringer's theorem can be 
seen in such a manifestly simple way that it is essen- 
tially proof by inspection. Additionally, the way that 
selection, schema destruction and schema creation en- 
ter in different ways leads, using the same hierarchi- 
cal structure, to a version of Geiringer's theorem for 
a class of "weak" selection landscapes where we were 
able to quantify the meaning of "weak" . 

The fundamental coarse grained equation show an 
important form invariance, i.e. after passing from 
schemata of a given N2 to more coarse grained 
schemata of order A^ < N2, the resulting equations 
have exactly the same form. This is certainly not 
manifestly true of the fundamental string equations 
. This form invariance is an important property as 
it implies that if a solution can be found for one de- 
gree of coarse graining then an analogous solution can 
simply be written down for the more coarse grained 
degrees of freedom. We showed that the eficacy of the 
present formulation lies not only in its intutive appeal 
and the facility with which more general formal re- 
sults can be deduced but also in how one may derive 
explicit analytic results, as in the case of an exact so- 
lution for the evolution of strings in the case of a flat 
fitness landscape and 1-point crossover. 

We believe that our previously derived results and the 
results herein are the tip of the iceberg and that the 
present formalism may serve as a starting point for de- 
riving a whole host of similar results and beyond. To 
name just a couple of obvious ones: finding the asym- 
potic limiting distributions for other types of crossover 
and including in mutation and finding the exact dy- 
namics for other types of crossover operator One of 
the most intriguing possibilities, that we have briefly 
alluded to, is the possibility of deriving approximate 
results using a systematic approximation scheme such 
as perturbation theory. The existence of exact solu- 
tions in the flat fitness landscape and the zero crossover 
limit add weight to such a supposition as does the fact 
that the iterative solution of the equations (u\) leads 



to a diagrammatic series very similar to the Feynman 
diagrammatic series that appear in quantum field the- 
ory. 
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